Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/80063/24

Hardware–Software Co-Design of an Audio Feature Extraction Pipeline for Machine Learning Applications

01-Jul-2024 Research 2024 : July-September

Jure Vreča, Ratko Pilipović, Anton Biasizzo

Keyword spotting is an important part of modern speech recognition pipelines. Typical contemporary keyword-spotting systems are based on Mel-Frequency Cepstral Coefficient (MFCC) audio features, which are relatively complex to compute. Considering the always-on nature of many keyword-spotting systems, it is prudent to optimize this part of the detection pipeline. We explore the simplifications of the MFCC audio features and derive a simplified version that can be more easily used in embedded applications. Additionally, we implement a hardware generator that generates an appropriate hardware pipeline for the simplified audio feature extraction. Using Chisel4ml framework, we integrate hardware generators into Python-based Keras framework, which facilitates the training process of the machine learning models using our simplified audio features.

How to Cite this Article
Attribution/ CC Compliant Citation: Vreča, Jure, Ratko Pilipović, and Anton Biasizzo. "Hardware -Software Co-Design of an Audio Feature Extraction Pipeline for Machine Learning Applications." Electronics 13.5 (2024): 875. https://doi.org/10.3390/electronics13050875 http://creativecommons.org/licenses/by/4.0/ Some formatting elements, header, footer, logos, dates and pagination were modified while adapting this article.
Download Full Text

Call Us: +4 (800) 888-0008

Inventi Impact: Audio, Speech & Music Processing

Articles

Inventi:easm/80063/24

Hardware–Software Co-Design of an Audio Feature Extraction Pipeline for Machine Learning Applications

How to Cite this Article

Links

Contact Us